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Methylation of cytosine is a DNA modification associated with gene repression. Recently, a novel cytosine modifi- 
cation, 5-hydroxymethylcytosine (5-hmC) has been discovered. Here we examine 5-hmC distribution during mam- 
malian development and in cellular systems, and show that the developmental dynamics of 5-hmC are different 
from those of 5-methylcytosine (5-mC); in particular 5-hmC is enriched in embryonic contexts compared to adult 
tissues. A detectable 5-hmC signal appears in pre-implantation development starting at the zygote stage, where the 
paternal genome is subjected to a genome-wide hydroxylation of 5-mC, which precisely coincides with the loss of the 
5-mC signal in the paternal pronucleus. Levels of 5-hmC are high in cells of the inner cell mass in blastocysts, and 
the modification colocalises with nestin-expressing cell populations in mouse post-implantation embryos. Compared 
to other adult mammalian organs, 5-hmC is strongly enriched in bone marrow and brain, wherein high 5-hmC con- 
tent is a feature of both neuronal progenitors and post-mitotic neurons. We show that high levels of 5-hmC are not 
only present in mouse and human embryonic stem cells (ESCs) and lost during differentiation, as has been reported 
previously, but also reappear during the generation of induced pluripotent stem cells; thus 5-hmC enrichment cor- 
relates with a pluripotent cell state. Our findings suggest that apart from the cells of neuronal lineages, high levels of 
genomic 5-hmC are an epigenetic feature of embryonic cell populations and cellular pluri- and multi-lineage potency. 
To our knowledge, 5-hmC represents the first epigenetic modification of DNA discovered whose enrichment is so cell- 
type specific. 
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Introduction 

The methylation of cytosines in a CpG context is a 
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common modification in mammalian genomes, and is 
generally associated with gene repression in vivo and 
in vitro [1]. Alteration of 5-methylcytosine (5-mC) pat- 
terns during development contributes to the regulation of 
gene expression and cell specification [1-3]. In addition 
to 5-mC, a novel cytosine modification, 5-hydroxymeth- 
ylcytosine (5-hmC), has recently been found in mouse 
brain and murine embryonic stem cells (mESCs) [4, 5]. 
The conversion of 5-mC to 5-hmC is catalysed by Tet 
(Ten-eleven translocation) oncogene family member 
proteins [4, 6]. Notably, as 5-hmC is interpreted as 5-mC 
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in bisulphite sequencing [7-10], the routine method of 
mC identification, these two cytosine modifications are 
indistinguishable from each other in the vast majority of 
currently available experimental results. Therefore, there 
is a need to re-evaluate many DNA methylation data, 
taking into account the existence of this novel cytosine 
modification with its (probable) distinct functional role. 
Since it has been already reported that methyl-CpG bind- 
ing proteins do not interact with 5-hmC-containing DNA 
substrates [7, 11], these two modifications are likely to 
play distinct roles in biological systems. Although a re- 
cent report suggested the importance of Tetl in mESC 
self-renewal and inner cell mass (ICM) specification in 
early embryos [6], the biological functions and devel- 
opmental distribution of genomic 5-hmC levels have 
not been studied. Here we assessed 5-hmC distribution 
throughout mammalian development, and in adult tissues 
in vivo and in in vitro cell systems using immunochemi- 
cal methods. 

Results 

Genomic 5-hmC is enriched in embryonic and induced 
pluripotent stem cells compared to differentiated cells 

We used two commercially available anti-5-hmC 
antibodies produced by Diagenode and Active Motif 
for our analysis. Since the Diagenode antibody has not 
been characterised in immunochemistry previously, we 
first confirmed its specificity in dot blot assays using 
PCR-produced DNA fragments with all of the cytosines 
replaced by either 5-hmC or 5-mC, and total genomic 
human ESC and human dermal fibroblast (HDF) DNA 
(Supplementary information, Figure SI A). The anti- 
5-hmC antibody specifically recognised the 5-hmC- 
enriched PCR fragment and hESC genomic DNA, but 
not 5 -mC -containing or unmodified PCR fragments or 
HDF DNA. The dot blot assay exhibited relatively low 
sensitivity, and only the equivalent of 500 ng of total 
genomic DNA produced a detectable signal with the 
anti-5-hmC antibody. Since the estimated genomic pro- 
portion of 5-hmC in mESCs is relatively low (<1% total 
cytosine content) [4], we used a peroxidase-conjugated 
secondary antibody coupled with a tyramide signal en- 
hancement system for 5-hmC detection in subsequent 
immunochemical staining experiments. Under these 
conditions, the anti-5-hmC antibody produced distinctive 
nuclear staining patterns on mESCs and hESCs (Figure 
1A and Supplementary information, Figure SIB), but not 
on human and mouse cancer and immortalised cell lines 
(Supplementary information, Figure SIC). These results 
were consistent with previously reported data obtained 
by thin layer chromatography (TLC) [4, 5] and with 
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our dot blot results (Supplementary information, Figure 
SI A). We obtained essentially identical results using the 
Active Motif anti-5-hmC antibody, which has been suc- 
cessfully used in dot blots, immunochemistry and other 
applications previously [6, 9, 12, 13]. In our experiments 
5-hmC was strongly enriched in hESCs, compared to a 
very weak 5-hmC signal in HDFs (Supplementary infor- 
mation, Figure S2). Both antibodies also behaved simi- 
larly in all further experiments. We concluded that the 
Diagenode anti-5-hmC antibody specifically recognises 
5-hmC, but not 5-mC, in immunocytochemical assays. 
Notably, the distribution of 5-hmC staining in ESCs was 
strikingly different from that of 5-mC. The intensity of 
the 5-hmC signal was strong in the cells forming the 
colonies of ESCs, and was considerably weaker in sur- 
rounding differentiating fibroblast-like "stromal" cells, 
whereas 5-mC was detectable in both cell types at similar 
levels (Figure IB and Supplementary information, Figure 
SID). This is consistent with the observation that 5-hmC 
content decreases upon differentiation of murine ESCs, 
as reported previously [4, 14]. We decided to examine 
the dynamics of this process using a protocol to differen- 
tiate hESCs to hepatic endoderm [15]. The 5-hmC signal 
dropped significantly following a 3-day treatment with 
Wnt3A and Activin to induce definitive endoderm, and 
was virtually undetectable after an additional 2 days of 
differentiation (Figure 1C). Concurrent analysis of tran- 
scriptome changes by global quantitative deep sequenc- 
ing revealed a parallel reduction in the number of Oct4 
and Nanog transcripts (Figure ID). Unexpectedly, we 
observed an increase in the numbers of transcripts of all 
three Tet genes at 3 days post-induction at the time point 
where the 5-hmC signal dropped significantly (Figure 
ID). Possible explanations for this discrepancy include 
the presence of mechanisms regulating Tet proteins at 
either the post-transcriptional/post-translational levels, 
or the inhibition of their oxygenase enzymatic activity. 
Over 5 days of differentiation, the numbers of Tetl, Tetl 
and Tet 3 transcripts also decreased (Figure ID). To deter- 
mine the correspondence of 5-hmC with acquisition of a 
pluripotent state we carried out 5-hmC immunostaining 
on human induced pluripotent stem cells (iPSCs) [16] 
and their parental somatic cell lines. hiPSC lines gener- 
ated from both normal and Huntington's disease tissues 
(Supplementary information, Figure S3), but not from 
their parental HDFs, exhibited a strong 5-hmC signal 
(Figure IE and Supplementary information, Figure SID). 
In summary, our screening of in vitro cell systems for 
the presence of immunochemically detectable levels of 
5-hmC revealed a strong correlation of high levels of this 
modification with a pluripotent stem cell state. 
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Figure 1 The presence of immunochemically detectable 5-hydroxymethylcytosine (5-hmC) levels correlates with pluripotency 
in mammalian cells. 5-hmC is detectable in mouse and human (RH1 and RCM1) ESCs. (A) Lines stained with 5-hmC anti- 
body. (B) 5-hmC and 5-mC staining of mouse ESC colonies (defined morphologically on the basis of phase images and in- 
dicated by dotted lines) and surrounding fibroblast-like differentiating cells. 5-hmC Immunostaining (C) and quantitative tran- 
scriptome deep sequencing (Solexa) data for Oct4, Nanog, Tet1, 2 and 3 (D) of RCM1 cells differentiating towards hepatic 
endoderm. Identical exposure times were used for different time points. Days post-induction are shown. RPKM - reads per 
kilobase per million reads. (E) High levels of 5-hmC are detectable in human iPSCs but not in human dermal fibroblasts (HDFs). 
DNA signal is indicated. Scale bars are 20 \im in A and E, 50 fim in B and 100 jim in C. The experiments were performed us- 
ing the Diagenode antibody. 



Global hydroxymethylation of the paternal genome coin- 
cides with the loss of 5-methylcytosine signal in zygote 

Several hours after fertilisation, chromatin of both ma- 
ternal and paternal pronuclei of the mammalian zygote is 
subjected to extensive genome-wide reprogramming as 
the embryo makes a transition from a germ cell to a so- 
matic developmental programme [2, 17]. One notable re- 
programming event is a dramatic loss of the 5-mC signal 
detectable by immunochemistry only in the paternal pro- 
nucleus after fertilisation [18]. It has been suggested that 
conversion of 5-mC to 5-hmC may represent an interme- 
diate step in the process of cytosine demethylation [4]. In 
agreement with a recent study [19], we could not detect 
any 5-hmC staining in the embryos before 2.5 h post- 
fertilisation, when a weak 5-hmC signal started to appear 
in the paternal pronucleus (Figure 2A, PN^, PN 2 . 3 ). The 



intensity of the 5-hmC signal increased dramatically dur- 
ing the next 2 h (Figure 2A, PN 3 . 4 , PN 4 ), peaking at 5-8 h 
post-fertilisation (Figure 2A, PN 5 , syngamy) coincident 
with a significant decrease and subsequent loss of 5-mC 
staining in the male pronucleus. The majority of embryos 
exhibited a strong 5-hmC signal, exclusively in the pater- 
nal pronucleus, whereas the maternal pronuclear 5-hmC 
staining was weak or undetectable (Figures 2A, Supple- 
mentary information, S4A, S4B). This corresponded to 
a strong 5-mC signal in the maternal pronucleus and its 
loss in the paternal pronucleus. Some embryos (11.4% 
at PN 4 _ 5 ) showed comparable levels of strong 5-hmC 
staining in both male and female pronuclei concurrently 
with equally low or absent 5-mC signals in both pro- 
nuclei (Supplementary information, Figure S4A, S4B). 
Nonparametric statistical analysis showed a significant 
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Figure 2 5-Hydroxymethylcytosine (5-hmC) distribution in mouse pre-implantation development. (A) Distribution of 5-hmC 
and 5-mC during indicated pronuclear stages in mouse zygote. Merged and phase views, where pronuclei are distinguishable 
at PN 3 -PN 4 stages, are shown. The exposure times were identical for each channel. 5-hmC antibodies produced by Active 
Motif were used. Scale bar, 10 |xm. (B) 5-hmC Distribution at the indicated later stages of mouse pre-implantation develop- 
ment. 5-hmC Antibodies produced by Active Motif were used. 5-mC signal and merge view are indicated. ICM is arrowed. 
Scale bar, 10 nm. 
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negative correlation (r = -0.9834, p < 0.0001) between 
5-hmC and 5-mC immuno staining scores (Supplemen- 
tary information, Figure S4A-S4C), suggesting that a 
decrease in 5-mC staining intensity is related to an in- 
crease in 5-hmC content in the mouse zygote at PN 4 . 5 . 
This most likely represents genome-wide 5-mC conver- 
sion into 5-hmC. Since 5-hmC is not detectable using an 
anti-5-mC antibody [4] and accumulates in the paternal 
genome at significant levels during the embryo's first cell 
cycle, together with a previous report, [19] these results 
suggest that the loss of 5-mC signal in the zygote at least 
partially represents a consequence of genome-wide 5-mC 
conversion into 5-hmC. 5-hmC-Independent demethy- 
lation of several genes has indeed been detected using 
bisulphate sequencing in the zygote [20], but bisulphite 
sequencing has never been performed on a genome- 
wide scale in preimplantation embryos. It is most likely 
that hydroxylation-independent demethylation of certain 
genome regions and global 5-mC hydroxylation occurs 
simultaneously in mouse embryos after fertilisation. 



High levels of 5-hmC colocalise with the ICM in blasto- 
cysts and with nestin-expressing cell populations in post- 
implantation mouse embryos 

To determine the dynamics of 5-hmC content in later 
mammalian development, we carried out whole-mount 
immunostaining of pre-implantation and sections of post- 
implantation mouse embryos. A detectable 5-hmC signal 
was present throughout all the stages of pre-implantation 
development, slightly decreasing by the 8-cell stage (Fig- 
ure 2B). Strong 5-hmC staining started to appear at the 
morula stage and was observed in mouse blastocysts with 
the maximum signal intensity in the ICM (Figure 2B and 
Supplementary information, Figure S5A). In contrast, the 
5-mC signal showed less perceptible variation between 
the ICM and trophoblast cells of blastocysts than did the 
5-hmC signal (Figure 2B and Supplementary informa- 
tion, Figure S5B). 

The 5-hmC enrichment in the ICM of mouse blasto- 
cysts is consistent with our immunochemical detection 
of 5-hmC in mESCs and previous TLC data [4, 5]. It is 
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also consistent with the expression of Tet proteins in both 
mESCs and the blastocyst ICM [4, 6]. Interestingly, the 
study of murine post-implantation embryos revealed a 
pattern of regional enrichment of 5-hmC nuclear staining 
correlated with the presumptive location of multi-potent 
progenitor or stem cells, becoming increasingly restricted 
with development (Figure 3A). At 10.5 days post-coitum 
(dpc) most embryonic tissues were highly enriched for 
5-hmC (Figure 3 A). At 12.5 dpc, mouse embryos ex- 
hibited intense 5-hmC staining in a range of embryonic 



tissues. Significantly higher levels of 5-hmC were ob- 
served in the brain, spinal cord, tailbud and liver but not 
in heart, skin or mesenchymal tissue (Figure 3A and 3B, 
Supplementary information, Figure S5C-S5G, Figure 
4A-4D). By 15.5 dpc, 5-hmC staining was generally less 
intense and limited to the brain and peripheral neural 
tissue (Figure 3A and 3B). According to 5-hmC signal 
quantification, the intensity of 5-hmC staining gradually 
decreased in brain tissue over time and fell dramatically 
in embryonic tongue between 12.5 dpc and 15.5 dpc 
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Figure 3 The dynamics of 5-hydroxymethylcytosine (5-hmC) in mouse post-implantational embryonic development. (A) 
Sagital sections of 10.5 days post coitum (dpc), 12.5 dpc and 15.5 dpc mouse embryos stained with Diagenode antibody to 
5-hmC. Immunostaining for single-stranded DNA is shown in red. (B) Fragments of brain and tongue (indicated) embryonic 
sagital sections at 10.5 dpc, 12.5 dpc and 15.5 dpc stages (indicated). Corresponding slides were immunostained for 5-hmC 
in parallel with identical conditions and imaged with the same exposure times. Cell populations of embryonic tongue enriched 
in 5-hmC at 15.5 dpc are arrowed. (C) A schematic showing of the results of quantification of 5-hmC signal in embryonic 
brain and tongue at 10.5 dpc, 12.5 dpc and 15.5 dpc. Corresponding slides were immunostained for 5-hmC in parallel and 
imaged with the same exposure times. Mean values for mean signal intensities of 10 random measurements of parts of im- 
ages shown in B are presented. The same central part of developing brain was quantified for all the stages analysed. Error is 
expressed as s.e.m. (D) The results of 5-hmC immunostaining of 3 adjacent sections of 12.5 dpc embryo brain region using 
primary antibody mix pre-incubated with PCR-produced DNA fragments where all the cytosines were replaced with either 
5-hmC (5-hmC DNA) or 5-mC (5-mC DNA). Control staining without pre-incubation with any DNA is shown (no DNA). The 
5-hmC immunostaining is specifically competed by 5-hmC- but not by 5-mC-containing DNA. Genomic DNA was visualised 
using single-stranded DNA antibody (indicated) used at a high titre. The experiments were performed using the Diagenode 
antibody. 
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(Figure 3B and 3C). Whereas high levels of 5-hmC were 
detected in most of the cells of embryonic tongue at 10.5 
dpc and 12.5 dpc, 5-hmC distribution in this organ at 
15.5 dpc was limited to certain cell populations (Figure 
3B). As expected, over the same period of development, 
the global level of DNA methylation was not restricted 
to any particular embryonic cell populations and did not 
vary significantly between embryonic stages (Supple- 
mentary infonnation, Figures S5C-S5G, S6). 

To identify these 5-hmC-rich cell populations, we 
performed immunostaining for nestin, an established 
marker of multi-potent progenitors and stem cells [21, 
22], in 12.5 dpc mouse embryos. Adjacent serial sections 
were used for nestin and 5-hmC or 5-mC staining, as im- 
munodetection of 5-hmC and 5-mC requires treatment 
of tissues with 4 N HC1, which is incompatible with the 
preservation of most protein epitopes. We observed that 
nestin staining closely correlates with the intense 5-hmC 
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signal in brain, peripheral neural tissue and the tailbud, 
suggesting that a high genomic content of 5-hmC is a 
feature of nestin-expressing multi-potent fetal cells (Fig- 
ure 4E and 4F). Consistent with our observed associa- 
tion of high levels of 5-hmC with multi-lineage potency, 
we also observed a highly localised 5-hmC signal in 
the olfactory epithelium of 17.5 dpc mouse embryos 
(Supplementary information, Figure S7). The olfactory 
epithelium is known to contain multi-potent stem cells 
constantly differentiating into supporting and olfactory 
cells in the embryo and adult [23]. 

The presence of high 5-hydroxymethylcytosine levels is 
restricted to neuronal tissue and bone marrow in adult 
human and mouse 

Finally, we evaluated adult murine and human tis- 
sues for genomic 5-hmC distribution. In agreement with 
previously reported data obtained by TLC [5], we could 
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Figure 4 5-Hydroxymethylcytosine (5-hmC) localises with nestin-expressing stem cell populations in 12.5 dpc mouse em- 
bryos. (A) Sagital section of 12.5 dpc mouse embryo with DNA visualised and B, C, D, E and F views indicated with dotted 
squares. Immunohistochemical detection of 5-hmC in sagital sections of mouse embryonic brain (B, C) and heart (D). Lo- 
calisation of 5-hmC and nestin in the sagital adjacent sections of tailbud (E) and a part of the spinal cord (F). DAPI signal in 
nestin immunostaining was colourised as red. Anti-single-stranded DNA antibody was used for DNA visualisation in 5-hmC 
immunostaining experiments. Scale bars are 150 |im in B, C and D and 300 nm in E and F. The experiments were performed 
using the Diagenode antibody. 
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Figure 5 The presence of immunochemically detectable 5-hydroxymethylcytosine (5-hmC) levels is restricted to bone marrow 
and neuronal tissues in adult mammals. A range of indicated normal adult human tissues from a Folio histological tissue array 
were immunostained with anti-5-hmC antibody. Phase views are shown. An identical maximal exposure of the images made 
using the 488 nm (5-hmC, tyramide) filter is shown for all the samples, except brain and bone marrow, to illustrate the pattern 
of background staining, which does not correspond to the DNA signal (indicated) in the majority of tissues. All the triplicate 
tissue samples on the array exhibited identical staining patterns. Six different bone marrow specimens were tested and gave 
essentially the same results. Representative views are shown. Scale bars are 20 urn, except 30 u,m in bone marrow sample. 
Essentially the same results were obtained with samples of ovary, testis, kidney, small intestine, salivary gland, uterus, uter- 
ine cervix, mesothelium, eye muscle, adrenal gland, pancreas and breast tissue (see Supplementary information, Figure S9). 
The experiments were performed using the Diagenode antibody. 



not detect any significant 5-hmC signal in mouse liver, 
skin, ovary, spleen or heart (Supplementary informa- 
tion, Figure S8), nor in any human tissue tested, except 
for brain and bone marrow (Figure 5 and Supplementary 
information, Figure S9). In mouse brain, strong 5-hmC 
nuclear staining was found not only in the hippocampal 
region and subventricular zone, which have previously 
been reported as regions of active neurogenesis contain- 
ing populations of neural stem cells (NSCs) and neural 
progenitors [24-28], but also in the neuronal tissue of the 
cortex and other parts of the brain and cerebellum (Fig- 
ure 6A-6C, Supplementary information, Figure S8B). In 
particular, in the hippocampus, higher levels of 5-hmC 
were observed in the dentate gyrus than the adjacent py- 
ramidal layer (Figure 6A), but this difference may be due 
to the relatively higher cell density of the former. Brain 
white matter did not display an intense 5-hmC signal 



(Figure 6A, 6B and Supplementary information, Figure 
S8B). Consistent with these results and in agreement 
with already published data [29], NSC derived in vitro 
from hESCs also exhibit strong 5-hmC staining, which 
does not disappear or decline after 18 days of neuronal 
differentiation (Figure 6D and 6E), but differentiation of 
progenitors towards oligodendrocyte lineages is charac- 
terised by the progressive loss of 5-hmC staining (data 
not shown). Notably, the transcripts of all 3 Tet genes 
are highly expressed in human NSCs, which make them 
distinct from human ES cells where only Tetl and Tet3 
transcripts are present at significant levels (Figure 6F). 

Discussion 

Our study shows a correlation between high levels of 
genomic 5-hmC and multi-lineage-potential during mam- 



Cell Research | Vol 21 No 9 | September 2011 



AlexeyRuzovefa/. @ 



1339 

ABC D 

Hippocampus SVZ Cerebellum NSC Neurons 18 days 




HDFs hESC NSC 



Figure 6 5-Hydroxymethylcytosine (5-hmC) is dramatically enriched in both neuronal progenitors and differentiated post- 
mitotic cells of neuronal lineages. (A) 5-hmC Is visualised by immunochemistry in hippocampal A and ventricular (B) regions 
of mouse brain and cerebellar folia (C). Pyramidal layer (pi), dentate gyrus (dg), subventricular zone (svz) and corpus colos- 
sum (cc) are indicated. Scale bars are 250 |im. The experiments were performed using the Diagenode antibody. (D) Human 
neural stem cells (NSCs) exhibit high levels of 5-hmC staining, which does not disappear after 18 days of differentiation into 
neurons. hESC-derived NSCs (indicated) and neurons derived from them after 18 days (neurons 18 days) of differentiation 
were immunostained for 5-hmC. 5-hmC, DNAand phase views are indicated. Scale bars are 20 urn The experiments were 
performed using the Diagenode antibody. (E) The upper panel shows a representative field of human NSC culture used for 
5-hmC immunostaining in D stained for Sox1 (red nuclear signal), DAPI (blue) and nestin (green cytoplasmic signal). The 
majority of cells co-express both markers. Scale bar is 50 |im. The middle panel shows a representative field of the neuronal 
culture obtained after 18 days of differentiation and used for 5-hmC immunostaining in D, stained for p-lll tubulin (green), 
Synapsin I (red in the colour image and presented as single channel in the grayscale image on the right) and DAPI (blue). 
The majority of cells co-express p-lll tubulin and Synapsin (scale bar is 50 |im). The lower panel shows the same stained 
population at a higher magnification (scale bar is 10 urn). (F) The results of real-time RT-PCR analysis of Tet1/2/3 transcripts 
(indicated) in HDFs, human H1 ES cells and obtained from them human NSCs (indicated). The transcripts of all three Tet 
genes are highly expressed in human NSCs. The data were normalised with relative to GAPDH. Error is expressed as s.e.m. 



malian development and in a number of adult tissues. 
High 5-hmC level is a feature of induced or embryonic 
pluripotent stem cells. Nestin-expressing fetal stem cells 
in post-implantation development, and stem cell-rich tis- 
sues including adult bone marrow and nasal epithelium 
of the late development mouse embryo also exhibit high 
5-hmC levels, whereas the actively proliferating unipo- 
tent cells of adult mammalian skin or liver do not. Some- 
times it appears that a strong 5-hmC signal coincides 

www.cell-research.com | Cell Research 



with stronger DNA staining (SVZ, olfactory epithelium), 
but it is worth noting that a higher DNA content is an 
expected feature of some rapidly proliferating stem cell 
populations, wherein a significant proportion of cells is 
present in the S-phase of the cell cycle. A more detailed 
analysis will be required to determine if high 5-hmC is 
being utilised by other adult tissue-specific stem cells. 
According to our study, neuronal lineages seem to be 
exceptional in their maintenance of high 5-hmC levels 



@ 5-hmC developmental distribution in mammals 



1340 



TOTIPOTENCY 



Zygote 



<j> Low 5-hmC 
(J High 5-hmC 



PLURIPOTENCY. MULTIPOTENCY 



I 
I 

Blastocyst ICM I 



Foetal and 
adult stem 
cells and 
progenitors 



DIFFERENTIATED 

Neuronal 
cells . 




High gencfnic 5-hmC 



Low 5-hmC 



Figure 7 A graphical summary illustrating 5-hydroxymethylcy- 
tosine (5-hmC) distribution in mammalian development. High 
levels of 5-hmC are marked with green shading. 



in differentiated, post-mitotic cells. In agreement with 
this finding it has been previously reported that dif- 
ferentiation of mESCs into neuronal progenitors and 
subsequently into mature neurons has been shown to be 
accompanied by relatively few changes in DNA methyla- 
tion, most of which occur during the first stage of differ- 
entiation [30, 31]. Thus, neurons may represent a unique 
mammalian post-mitotic differentiated cell type, with a 
chromatin organisation similar in some aspects (5-hmC 
enrichment) to that of stem cells. Our analysis of 5-hmC 
distribution in adult mouse tissues is supported by other 
reports on 5-hmC tissue quantification showing that 
5-hmC is strongly enriched in the central nervous sys- 
tem, compared to other adult organs [5, 12, 14]. Our data 
are also generally in agreement with TLC-based data 
[5], an isotope-based chromatographic method for quan- 
tification of 5-hmC [12], and with immunolocalisation 
of 5-hmC in mouse brain presented in the same study 
[12]. In contrast to Globisch and colleagues, our results 
do not reveal immunochemically detectable 5-hmC in 
adult mouse liver and kidney, but the resolution of the 
corresponding figures in that report does not allow us to 
draw conclusions on the specificity of the observed stain- 
ing in these tissues. It is also important to note that there 
is a slight discrepancy between the study of enzymatic 
quantification of genomic 5-hmC [14], where its levels 
in mouse lung and, especially, in kidney were similar to 
those in brain, and both our immunochemical data and 
the other published data, based on biochemical methods 
[5, 12], where the 5-hmC content of lung and kidney was 
reported to be much lower than that of neural tissues. It 



is possible that these differences stem from the nature 
and sensitivity of the methods used to quantify genomic 
5-hmC in the corresponding reports. 

Previous studies have established that 5-mC- and 
5 -hmC -containing DNAs have different protein binding 
properties; for example, the interaction of MBD proteins 
with 5-mC is abolished by hydroxymethylation [7, 11]. 
This implies that these two epigenetic modifications 
perform distinct biological functions. It is possible that 
an elevated genomic content of 5-hmC could contribute 
both to the general "openness" of the chromatin in stem 
cell and neuronal nuclei, and to the regulation of expres- 
sion of specific target genes during stem cell self renewal 
and differentiation. Further work is necessary to reveal 
the identity and roles of any 5-hmC binding proteins, and 
identify genes affected by hydroxymethylation of cyto- 
sine residues in tissue-specific contexts. 

In conclusion, we show that unlike 5-mC, genomic 
5-hmC is significantly enriched in embryonic contexts, 
cells with multi-lineage potency and neurons in mam- 
mals (Figure 7). To our knowledge, 5-hmC is the first 
epigenetic modification shown to possess such a cell- 
type-specific enrichment in its distribution. 

Materials and Methods 

ES cell culture and iPSC generation 

RH1 and RCM1 hESCs have been described previously [32, 
33]. Mouse HM1 ESCs were maintained on gelatin-coated dishes 
in Glasgow's Minimum Essential medium (GIBCO), supple- 
mented with 15% fetal bovine serum, 55 uM (3-mercaptoethanol 
(GIBCO), 2 mM L-glutamine, 0.1 mM MEM non-essential amino 
acids, 5 000 units ml 1 penicillin/streptomycin and 1 000 units ml -1 
of murine LIF (Chemicon) under feeder-free conditions. hESCs 
and hiPSCs were cultured on matrigel-coated plates and slides in 
feeder-free mTeSRl medium (STEMCELL Technologies). Cells 
were passaged using collagenase IV (200 units ml in DMEM, 
GIBCO) and mechanical dissociation. hESC differentiation to he- 
patic endoderm was performed according to the published protocol 
[15]. hiPSCs were produced as described previously [16] and vali- 
dated by flow cytometry, embryoid body formation, differentiation 
into endoderm, mesoderm and ectoderm and RT-PCR as described 
[32]. 

Immunocytochemistry, immunohistochemistry, pre-implan- 
tation embryo culture, imaging and dot blot assay 

For immunocytochemistry, cells were fixed in 4% formalde- 
hyde for 15 min. Paraffin-embedded formaldehyde-fixed sections 
of wild-type CD1 mouse embryos and adult tissues were used for 
immunohistochemistry. A normal human tissue histological array 
(Folio) was used for 5-hmC detection in human tissues. Tissue sec- 
tions were de-waxed according to standard procedures. Cells and 
tissue sections were permeabilised for 15 min with PBS containing 
0.5% Triton X-100. For 5-hmC and 5-mC staining, permeabilised 
cells and tissue sections were incubated in 4 N HC1 for 1 h at 37 
°C and then neutralised in 100 mM Tris-HCl (pH 8.5) for 10 min, 
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followed by a standard immunostaining protocol. 

Anti-5-hmC (Diagenode, 1:1 000 dilution; Active Motif, 1:5 000; 
1:50 000 dilutions), anti-5-mC (Eurogentec) and anti-nestin clone 
401 (DSHB) primary antibodies were used. DNA was visualised 
using an anti-single-stranded DNA antibody (Zymo research). 
Peroxidase-conjugated anti-rat secondary antibody (Dako) and 
the tyramide signal enhancement system (Perkin Elmer) were em- 
ployed for 5-hmC detection. 

Mouse embryos were produced by mating superovulated F 1 
females with CD1 males, and cultured according to standard pro- 
cedures. Anti-5-hmC (Active Motif, 1:5 000 dilution) and anti- 
5-mC (Eurogentec, 1:200 dilution) antibodies were used for im- 
munochemistry. Pre-implantation embryos were immunostained 
as described [34] with the use of peroxidase-conjugated second- 
ary antibody and the additional tyramide (Perkin-Elmer) signal 
enhancement step for 5-hmC staining. Control staining without 
primary antibody produced no detectable signal. Images were 
acquired using a Zeiss Axiovert 200 immunofluorescence micro- 
scope and Axiovision software. Dot blot assays were performed as 
reported previously [10]. 

Image quantification 

Image quantification was perforated using Fiji software. Cor- 
responding slides were processed in identical conditions and 
were imaged at the same exposure settings. Mean intensities were 
measured for 10 random squares on each region of interest for 
each sample. The 5-hmC signal was standardised to the total DNA 
signal. Mean values of the mean intensities are presented. Experi- 
mental error is expressed as s.e.m. 

Statistical analysis 

We used a numerical grading system 0 (absent), 1 (low) and 
2 (high) to account for differences in 5-hmC and 5-mC staining 
intensities in maternal and paternal pronuclei of the mouse zygotes 
(the examples of the categories are arrowed on Supplementary 
information, Figure S4A). The statistical analysis was performed 
using GraphPad Prism software. The individual 5-hmC and 5-mC 
immunostaining scores (total number of xy pairs = 70) of both 
maternal (n = 35) and paternal pronuclei in = 35) were correlated 
using Spearman's nonparametric test. The P value was considered 
significant for P < 0.05. 

Solexa deep sequencing 

Total RNA was isolated from hESCs at the times indicated after 
induction of differentiation and sequenced directly using a Solexa 
1G system. The Solexa results were normalised as follows: the raw 
count reads of each gene were first divided by the total number 
of aligned reads multiplied by 1 million to take into account the 
depth of sequencing. The results were then divided by the length 
of each gene multiplied by 1 000 to get an RPKM value (read per 
kilo bases per million reads). 

Neural stem cells 

Neural stem cells were generated from p57 H9 hESCs and 
cultured as described [35]. NSCs' identity was confirmed by im- 
munostaining for Pax6, Sox2, Soxl and Nestin. H9 hESCs were 
cultured on Matrigel with TeSR2 (STEMCELL Technologies) 
medium. Differentiation was performed in Neurobasal medium (N2 
supplement 1:100). 



Real time RT-PCR 

Total RNA was extracted with TriReagent (Sigma). Samples 
were reverse transcribed using random primers (Promega) and 
Superscript II RT (Invitrogen). Products were detected using 
SYBR Green PCR Mastermix (Applied Biosystems) and a PTC- 
200 cycler with a Chromo-4 detection system (MJ Research). Data 
were normalised with relative to GAPDH. Error is expressed as 
s.e.m. Primer sequences for human Tet proteins are available upon 
request. 

Immunostaining competition experiment 

Immunostaining competition experiments were performed 
as described [12] using PCR-produced 100-bp DNA fragments, 
where all the cytosines were replaced with either 5-hmC or 5-mC. 
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